AITopics | delta table

Collaborating Authors

delta table

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

ReeFRAME: Reeb Graph based Trajectory Analysis Framework to Capture Top-Down and Bottom-Up Patterns of Life

Gudavalli, Chandrakanth, Zhang, Bowen, Levenson, Connor, Lore, Kin Gwn, Manjunath, B. S.

arXiv.org Artificial IntelligenceOct-18-2024

In this paper, we present ReeFRAME, a scalable Reeb graph-based framework designed to analyze vast volumes of GPS-enabled human trajectory data generated at 1Hz frequency. ReeFRAME models Patterns-of-life (PoL) at both the population and individual levels, utilizing Multi-Agent Reeb Graphs (MARGs) for population-level patterns and Temporal Reeb Graphs (TERGs) for individual trajectories. The framework's linear algorithmic complexity relative to the number of time points ensures scalability for anomaly detection. We validate ReeFRAME on six large-scale anomaly detection datasets, simulating real-time patterns with up to 500,000 agents over two months.

artificial intelligence, data mining, reeb graph, (12 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3681765.3698452

2410.14913

Country:

North America > United States > Georgia > Fulton County > Atlanta (0.06)
North America > United States > Connecticut (0.04)
North America > United States > California > Santa Barbara County > Santa Barbara (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)

Genre: Research Report (0.64)

Industry: Information Technology > Services (0.68)

Technology:

Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Add feedback

Scalable Vector Search for AI Apps with Milvus and Databricks

#artificialintelligenceOct-14-2022, 00:15:21 GMT

Multi-modal embeddings are all the rage these days. Everyone wants a piece of them because they give you a way to convert unstructured data to representations that are useful for understanding the semantic nature of unstructured assets -- across image, text, audio, video, etc. These representations are vectors that can be used for a variety of purposes across use cases which require models for image similarity, deduplication, anomaly detection, text similarity, audio classification, video understanding, etc. To top that off, you don't have to be a data scientist with deep ML expertise to build these systems, nor do you need to have large amounts of data to start leveraging them. This is fine until you run into actual "hands on the keyboard" work for production.

databrick, milvus, scalable vector search, (14 more...)

#artificialintelligence

Technology:

Information Technology > Data Science > Data Mining (0.35)
Information Technology > Artificial Intelligence > Natural Language (0.35)

Add feedback

Benchmarking Amazon EMR vs Databricks

#artificialintelligenceFeb-18-2022, 13:10:14 GMT

At Insider, we use Apache Spark as the primary data processing engine to mine our clients' clickstream data and feed ML-ready data into our machine learning pipelines to enable personalizations. We have been using Spark since version 1.5 and always looking for ways to improve efficiency. If you are interested too, check out our blog post about how Spark 3 reduced our Amazon EMR cost by 40%. To further improve our platform's efficiency, we decided to conduct a trial with the Databricks platform. Before moving forward with the Databricks platform and the benchmarks, let's see how we utilize Apache Spark and Amazon EMR, and the pain points to understand better our current solutions and challenges.

amazon emr, databrick, delta table, (12 more...)

#artificialintelligence

Industry: Information Technology (0.50)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.72)

Add feedback

Machine Learning Data Lineage with MLflow and Delta Lake - Databricks

#artificialintelligenceAug-21-2020, 04:51:18 GMT

Then we will show a live demo on how to use various versioning features from these two frameworks to achieve data lineage in the machine learning process. We know that Machine Learning Development is complex. To give a sense of it, this is a typical machine learning pipeline. You take your raw data, you do some ETL or featurise it or data prep. Then you want to do some training with this data to produce a model and deploy this model to production.

artificial intelligence, delta lake, machine learning, (17 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback